Naïve Bayes technique for diatoms classification with discretised input
نویسندگان
چکیده
The challenge to discover knowledge from environmental data that has led to usage of methods and techniques such as data mining tools, can bridge the knowledge gap between the biological experts and organisms. This research aimed to assess relationships between the diatoms and the indicators of the environment with Naïve Bayes method. Diatoms are ideal indicators of certain physical-chemical parameters and they can be classified into one of the water quality classes (WQCs). The classification models are induced by using Naïve Bayes technique. The input dataset that is supplied for the naïve Bayes method is discretised. Based on the evaluation results, several models are presented and discussed. The obtain results from the models are verified with existing diatom ecological preference and for some diatoms new knowledge is added. To best of our knowledge, this is the first time the prosed method to be applied for diatom classification of any ecosystem.
منابع مشابه
Enhanced Naïve Bayes Algorithm for Intrusion Detection in Data Mining
Classification is a classic data mining technique based on machine learning. Classification is used to classify each item in a set of data into one of predefined set of classes or groups. Naïve Bayes is a commonly used classification supervised learning method to predict class probability of belonging. This paper proposes a new method of Naïve Bayes Algorithm in which we tried to find effective...
متن کاملDecision Tree Induction 17.1 Introduction 17.2 Attribute selection measure 17.3 Tree Pruning 17.4 Extracting Classification Rules from Decision Trees 17.5 Bayesian Classification 17.6 Bayes Theorem 17.7 Naïve Bayesian Classification 17.8 Bayesian Belief Networks
متن کامل
Naïve Bayes Classifier with Various Smoothing Techniques for Text Documents
Due to huge amount of increase in text data, its classification has become an important issue, now days. There are many good classification techniques discussed in this paper. Each classification method has its own assumptions, advantages and limitations. One of the most widely used classifier is Naïve Bayes which performs well with different data sets. Various Smoothing techniques are applied ...
متن کاملPrivacy Preserving Naïve Bayes Classifier for Horizontally Distribution Scenario Using Un-trusted Third Party
The aim of the classification task is to discover some kind of relationship between the input attributes and the output class, so that the discovered knowledge can be used to predict the class of a new unknown tuple. The problem of secure distributed classification is an important one. In many situations, data is split between multiple organizations. These organizations may want to utilize all ...
متن کاملPerformance Comparison of Naïve Bayes and J48 Classification Algorithms
Classification is an important data mining technique with broad applications. It classifies data of various kinds. Classification is used in every field of our life. Classification is used to classify each item in a set of data into one of predefined set of classes or groups. This paper has been carried out to make a performance evaluation of Naïve Bayes and j48 classification algorithm. Naive ...
متن کامل